首页> 外文OA文献 >Collective Influence Algorithm to find influencers via optimal percolation in massively large social media
【2h】

Collective Influence Algorithm to find influencers via optimal percolation in massively large social media

机译:集体影响算法通过最优方法找到影响者   大规模社交媒体中的渗透

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

We elaborate on a linear time implementation of the Collective Influence (CI)algorithm introduced by Morone, Makse, Nature 524, 65 (2015) to find theminimal set of influencers in a network via optimal percolation. We show thatthe computational complexity of CI is O(N log N) when removing nodesone-by-one, with N the number of nodes. This is made possible by using anappropriate data structure to process the CI values, and by the finite radius lof the CI sphere. Furthermore, we introduce a simple extension of CI when l isinfinite, the CI propagation (CI_P) algorithm, that considers the globaloptimization of influence via message passing in the whole network andidentifies a slightly smaller fraction of influencers than CI. Remarkably, CI_Pis able to reproduce the exact analytical optimal percolation thresholdobtained by Bau, Wormald, Random Struct. Alg. 21, 397 (2002) for cubic randomregular graphs, leaving little improvement left for random graphs. We alsointroduce the Collective Immunization Belief Propagation algorithm (CI_BP), abelief-propagation (BP) variant of CI based on optimal immunization, which hasthe same performance as CI_P. However, this small augmented performance of theorder of 1-2 % in the low influencers tail comes at the expense of increasingthe computational complexity from O(N log N) to O(N^2 log N), rendering both,CI_P and CI_BP, prohibitive for finding influencers in modern-day big-data. Thesame nonlinear running time drawback pertains to a recently introducedBP-decimation (BPD) algorithm by Mugisha, Zhou, arXiv:1603.05781. For instance,we show that for big-data social networks of typically 200 million users (eg,active Twitter users sending 500 million tweets per day), CI finds theinfluencers in less than 3 hours running on a single CPU, while the BPalgorithms (CI_P, CI_BP and BDP) would take more than 3,000 years to accomplishthe same task.
机译:我们详细阐述了Morone,Makse,Nature 524、65(2015)引入的集体影响力(CI)算法的线性时间实现,以通过最佳渗滤找到网络中的最小影响者集。我们证明,当逐个删除节点时,CI的计算复杂度为O(N log N),其中N为节点数。通过使用适当的数据结构来处理CI值,以及通过CI球面的有限半径l,这可以实现。此外,我们引入了当l为无限时CI的简单扩展,即CI传播(CI_P)算法,该算法考虑了通过整个网络中的消息传递对影响进行全局优化,并确定了比CI稍小的影响者。值得注意的是,CI_Pis能够再现Bau,Wormald,Random Struct获得的精确的分析最佳渗滤阈值。海藻21,397(2002)对于三次随机正则图,对随机图几乎没有改善。我们还介绍了基于最佳免疫的CI的集体免疫信念传播算法(CI_BP),可信度传播(BP)变体,其性能与CI_P相同。然而,低影响者尾部的这种1-2%的小幅度增强性能是以从O(N log N)到O(N ^ 2 log N)的计算复杂度增加为代价的,同时呈现CI_P和CI_BP,禁止在当今的大数据中寻找影响者。相同的非线性运行时间缺点与Mugisha,Zhou,arXiv:1603.05781最近引入的BP抽取(BPD)算法有关。例如,我们表明,对于通常有2亿用户的大数据社交网络(例如,活跃的Twitter用户每天发送5亿条推文),CI会在单个CPU上运行少于3个小时的时间找到影响者,而BP算法(CI_P ,CI_BP和BDP)需要3,000多年才能完成相同的任务。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号